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Abstract. The MULTICUT IN TREES problem consists in deciding, given a tree, a set 
of requests (i.e. paths in the tree) and an integer k, whether there exists a set of k edges 
cutting all the requests. This problem was shown to be PPT by Guo and Niedermeyer in 
|10) . They also provided an exponential kernel. They asked whether this problem has a 
polynomial kernel. This question was also raised by Pellows in [l]. 
We show that MULTICUT IN TREES has a polynomial kernel. 



1. Introduction 

An efficient way of dealing with NP-hard problems is to identify a parameter which 
contains its computational hardness. For instance, instead of asking for a minimum vertex 
cover in a graph - a classical NP-hard optimization question - one can ask for an algorithm 
which would decide, in 0{f{k).n'^) time for some fixed d, if a graph of size n has a vertex 
cover of size at most k. If such an algorithm exists, the problem is called fixed- parameter 
tractable, or FPT for short. An extensive litterature is devoted to FPT, the reader is invited 
to read [i], [7] and [IS]. 

Kernelization is a natural way of proving that a problem is FPT. Formally, a kernel- 
ization algorithm receives as input an instance (I, k) of the parameterized problem, and 
outputs, in polynomial time in the size of the instance, another instance {I',k') such that 

• k' <k, 

• the size of /' only depends of k, 

• the instances (/, k) and (/', k') are both true or both false. 
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The reduced instance (/', k') is called a kernel. The existence of a kernelization algo- 
rithm clearly implies the FPT character of the problem since one can kernelize the instance, 
and then solve the reduced instance G', k' using brute force, hence giving an 0{f{k) + n"') 
algorithm. A classical result asserts that being FPT is indeed equivalent to having kerneliza- 
tion. The drawback of this result is that the size of the reduced instance G' is not necessarily 
small with respect to k. A much more constrained condition is to be able to reduce to an 
instance of polynomial size in terms of k. Consequently, in the zoology of parameterized 
problems, the first distinction is done between three classes: W[l]-hard, FPT, polykernel. 

A kernelization algorithm can be used as a preprocessing step to reduce the size of the 
instance before applying an algorithm. Being able to ensure that this kernel has actually 
polynomial size in k enhances the overall speed of the algorithm. See for a recent review 
on kernalization. 

The existence of a polynomial kernel can be a subtle issue. A recent result by Fernau et 
al [6] shows that Rooted fc-Leaf Outbranching has a cubic kernel while /c-Leaf Outbranching 
does not, unless polynomial hierarchy collapses to third level, using a breakthrough lower 
bound result by Bodlaender and al [5]. 

In the (unweighted) MULTICUT IN TREES problem, we consider a tree T together with 
a set P of pairs of distinct nodes of T, called requests. Hence, a request can also be seen as 
a prescribed path joining these two nodes. We will often identify the request and its path. 
A multicut of (T, P) is a set S of edges of T which intersect every request in P, i.e. every 
path corresponding to a request contains an edge of S. 

Problem 1.1. MULTICUT IN TREES: 

Input: A tree T = {V,E), a set of requests P, an integer k. 

Output: TRUE if there is a multicut of size at most k, otherwise FALSE. 

Note that a more general presentation of this problem is to assign weights to edges, and 
ask for a multicut of minimal weight. Our technique does not seem to generalize to the 
weighted case. 

This problem appears in network issues (routing, telecommunication, ...). See for a 
survey on multicommodity flow problems and multicut problems. It was shown in [8] that 
Multicut In Trees is NP-complete, and its associated decision problem is MaxSNP-hard 
and has a factor-2 polynomial time approximation algorithm. 

This problem is known to be FPT, see |9j or [10] for a branching algorithm and an 
exponential kernel. The existence of a polynomial kernel was asked in [I]. We verify that 
Multicut In Trees has indeed an 0{k^) kernel. Our reduction is very much inspired 
from |9j and [lO]. In the next section, we first illustrate our techniques when the tree T is a 
caterpillar. In Section 3 we extend the proof to general trees. 

2. A polynomial kernel for caterpillars 

A node of T which is not a leaf is an internal node. The internal tree of T is the tree 
restricted to its internal nodes. We say that T is a caterpillar if its internal tree is a path. We 
consider the restriction of the MULTICUT IN TREES problem to caterpillars, as it contains 
the core of our proof in the general case. 



A POLYNOMIAL KERNEL FOR MULTICUT IN TREES 



185 



Let us give some general definitions which will apply both for the caterpillar case and 
for the general case. 

We say that two nodes x and y are R-neighbors if there exists a request xy. A leaf x 
and an internal node y are quasi- R-neighbors if there exists a request xy, or a request xz, 
where z is a leaf rooted at y. An internal node with no leaf attached to it is an inner node. 
If X is a leaf, we denote by e{x) and call the edge of x the edge adjacent to x. A group of 
leaves is the set of leaves connected to the same internal node. A group request is a request 
xy where x and y belong to the same group. A leaf which is an endpoint of a group request 
is a bad leaf. A leaf to leaf request is a request between two leaves. An internal request is 
a request between two internal nodes. A request between an internal node and a leaf is a 
mixed-request. Two requests are disjoint if their edge sets are disjoint. Two requests x\y\ 
and X2y2 are endpoint- disjoint if xi,yi, X2,y2 are pairwise different. 

The internal path of a request is the intersection between the path of the request and 
the internal tree. The common factor of two requests is the intersection of their paths. A 
request Ri dominates a request R2 if the internal path of Ri contains the internal path of 
R2. 

Contracting an edge e in (T, P) means contracting e in T, and transforming each request 
of the form (ei, . . . , et, e, et+i, . . . , e;) in P into (ei, ■ ■ ■ ,et, et+i, ■ ■ ■ , e;). Deleting an edge e 
means contracting e in T and removing every request containing e from P. 

Two requests of length at least 2 from a given leaf x have the same direction if the 
second edge of their path starting at x is the same. Two requests from an internal node x 
have the same direction if the first edge of their paths (starting at x) is the same. All the 
requests from x have the same direction if they pairwise have the same direction. 

In the following, our instance T is assumed to be a caterpillar. We call the two extrem- 
ities of the internal path the left end and the right end of T. The path between a node x 
and the right (resp. left) end will be called right and left relatively to x. 

Let T' be the internal tree of the caterpillar T. The following five sets partition T: 

• The set Ii of leaves of T' . 

• The set I2 of degree two nodes of T' . 

• The set Li of leaves rooted at Ii. 

• The set L'2 of bad leaves rooted at h- 

• The set L2 of the other leaves rooted at l2- 

The wingspan W oi a leaf x is the path between the closest quasi-R-neighbor on the 
right of X and the closest quasi-R-neighbor on the left of x (if no such neighbor exists, we 
take the father f{x) of x by convention). The size of a wingspan is the number of L2-leaves 
pending from it. The subcaterpillar of the wingspan W consists in W and the leaves rooted 
at W. The wingspan W dominates a request yz if both y and z belong to the subcaterpillar 
ofW. 

The usual way of exhibiting a kernel is to define a set of reduction rules. These rules 
should be safe, meaning that after applying a rule, the truth value of the problem on 
the instance does not change. Moreover the repeated application of the rules should take 
polynomial time. Finally, after iterating these rules on an instance, we want the reduced 
instance to be of polynomial size in k. 

The reduction rules. We apply the following reduction rules to an instance: 

(0) Unit Request: if a request R has length one, i.e. R = e for some edge e of T, then 
we delete e and decrease k by one. 
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(1) Disjoint Requests: if there are k + 1 disjoint requests in P, then we return a trivially 
false instance. 

(2) Unique Direction: if all the requests starting at a leaf x have the same direction, then 
contract e(x). If all the requests starting at an inner node x have the same direction, 
then contract the edge e adjacent to x which does not belong to any request starting 
at X. 

(3) Inclusion: if a request R is included in another request R', then delete R' from the 
set of requests. 

(4) Common Factor: let R he a request. If + 1 requests Ri, . . . ,Rk+i different from 
R but intersecting R are such that for every i j, the common factor of Ri and Rj 
is a subset of R, then delete R from the set of requests. 

(5) Dominating Wingspan: if x is an -L2-leaf with a wingspan dominating at least k-\-l 
endpoint-disjoint leaf to leaf or mixed requests, then contract e{x). 

Each iteration of the reduction consists in applying the first applicable rule, in the above 
order. 

Lemma 2.1. Rules Unit Request, Disjoint Requests, Unique Direction, Inclusion, Common 
Factor and Dominating Wingspan are safe. 

Proof. (0) Rule Unit Request is obvious. 

(1) Rule Disjoint Requests is obvious. 

(2) For Rule Unique Direction, assume first that all the requests from a leaf x have the 
same direction, and that a multicut contains e(x). Let e' be the second common 
edge of all these paths. As e' cuts all the requests cut by e(x), if e{x) is in a solution 
S then 5\{e(x)} U {e'} is also a solution. So we can contract e{x). Now, assume 
that all the requests from an inner node x go to the right. If a solution S contains 
the edge e adjacent to x on the left then 5'\{e} U {e'}, where e' is the right edge 
adjacent to is a solution since a request going through e also goes through e'. 

(3) For Rule Inclusion, observe that an edge cutting R also cuts all the paths containing 
R. 

(4) If there is a multicut of k edges, then one of these edges must intersect two requests 
among the k+1 mentioned in Rule Common Factor. This edge lies in the intersection 
of two paths, hence in R, so request R is cut in any multicut of P \ {R}. 

(5) Let X be an L2-leaf with a wingspan W dominating k + 1 endpoint-disjoint requests. 
If a multicut of size k exists, it contains an edge e which cuts two of these requests. 
As the requests are endpoint-disjoint, their intersection is included in the internal 
tree, hence in W. Assume, for example, that e is on the left of the leaf x. Then all 
the requests from x which go to the left go through e, and moreover x has no group 
request. Thus, if a solution exists, there is a solution without e(x), since e{x) can 
be replaced by the edge e' which is on the right of the neighbor of x. ■ 

Lemma 2.2. Deciding whether a rule applies and applying it takes polynomial time. 

Proof. Denote by n the number of nodes in T and by r the number of requests, which is 
0(n2). 

(0) The application of Rule Unit Request takes time 0(r). 

(1) The maximum edge-disjoint paths problem in trees is polynomial, see [5], thus Rule 
Disjoint Requests is polynomial. 
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(2) Rule Unique Direction can be applied in time O(rn^). 

(3) Rule Inclusion can be applied in time O(r^). 

(4) For the running time of Rule Common Factor, consider a request R. Informally, we 
are looking for a large enough set of requests which intersect R, possibly leaving it 
at one or two places, such that the edges through which they leave are all distinct. 
More formally, let Z be the set of edges not in R but sharing a vertex with some 
edge in R. Let Y be the set of edges e in Z such that there exists a request starting 
at a node in R and going through e. We can assume without any loss that one 
request per such edge e is chosen. Let G be the graph which vertices are Z — Y and 
which edges are the pairs (e, e') such that there exists a request going through both 
e and e' . There exist k + 1 paths as in Rule Common Factor if and only if G has a 
matching of size at least fc + 1 — |y|. As the matching problem is polynomial, the 
application of Rule Common Factor takes polynomial time. 

(5) Let be a wingspan, let G be the graph which vertices are the leaves pending from 
W and where two leaves are adjacent if there is a request between them. There exist 
k-\- 1 endpoint-disjoint requests dominated by W if and only if G has a matching of 
size fc + 1, thus Rule Dominating Wingspan is polynomial. ■ 



Lemma 2.3. The reduction process has a polynomial number of iterations. 

Proof. Each rule decreases the sum of the lengths of the requests, which is initially less than 
the number of requests times the number of nodes. ■ 

In the following we consider an instance in which none of these rules can be applied, 
and prove that such a reduced instance has polynomial size in k. 

Let us introduce two graphs theoretic lemmas which are used in our proof. 

Lemma 2.4. Let G he an undirected graph having m edges, of maximal positive degree A. 

Then G has a matching of size Lf^^J • 

Proof. Such a matching can be obtained by a greedy algorithm, as taking an edge uv in the 
matching forbids the edges adjacent to u and those adjacent to v (there are at most 2 A — 1 
such edges, including uv). m 

Lemma 2.5. Let H he an undirected graph on n vertices, of maximal degree A. Then H 
has an independent set of size L^^J ■ 

Proof. Such an independent set can be obtained by a greedy algorithm, as taking a vertex 
u in the independent set forbids the vertices adjacent to u. m 

Theorem 2.6. The MULTICUT IN CATERPILLARS prohlem has a kernel of size 0{k^). 

The rest of this section is dedicated to the proof of the theorem. 

Observation 2.7. A node has at most k + 1 R-neighbors in each direction. 

Proof. If a node x has k + 2 R-neighbors in, say, the right direction, then Rule Common 
Factor applies to any longest right request of x. ■ 
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Claim 1. There are at most 2{k + l){2k + 1) — 1 bad leaves. 

Proof. A bad leaf is connected to at most k + 1 leaves of some given group, by Rule Common 
Factor. Let G be the undirected graph whose vertices are the bad leaves of T and where 
there is an edge between two leaves if there is a group request between them. The minimal 
degree in G is at least 1, and the maximal degree is at most k + 1. If there are at least 
2{k + l){2k + 1) bad leaves then there are at least {k + l){2k + 1) edges in G. Thus by 
Lemma [2.41 there exist a matching of size k + 1 which implies the existence of A; + 1 endpoint- 
disjoint (thus disjoint) group requests. In this case, Rule Disjoint Requests would apply. ■ 

Claim 2. A wingspan has size at most 2{k + 1)(4A; + 3) — 1. 

Proof. Let be a wingspan. As Rule Dominating Wingspan does not apply, W does not 
dominate k + 1 endpoint-disjoint requests. Let W be the set of leaves pending from W. 
Let G be the undirected graph which vertices are the leaves in W' and the nodes in W. For 
each leaf to leaf request zy such that z and y are in W, create an edge zy in G. For each 
mixed-request zy such that z is in W' and y in W, create an edge zy in G. Finding k + 1 
endpoint-disjoint requests is equivalent to finding a matching of size A; -|- 1 in G. The degree 
of a vertex u in G is at most 2k + 2 because there are at most k + 1 requests in each direction 
for u in T (by Observation 12. 7p . Moreover, if u corresponds to a node of W, the degree of 
u is at least one. Indeed, since the wingspan of x is maximal, each L2-leaf pending from W 
must have a request dominated by W. 

If there are 2{k + 1)(4A: -|- 3) L2-leaves in W' , then G contains at least {k + l)(4/c -|- 3) 
edges, and so G has a matching of size A; -|- 1 by Lemma 12.41 which in turn means the 
existence of k + 1 endpoint-disjoint requests. ■ 

Claim 3. There are 0{k^) L2-leaves. 

Proof. Let rr be a L2-leaf of wingspan W. By the previous claim, there are less than 2{k + 
1)(4A; -|- 3) leaves pending from W. At most 2{k + l)(4fc -|- 3) L2-leaves not pending from 
W have wingspans intersecting W for each direction, as the furthest leaf (on the right) 
of wingspan intersecting W has a wingspan which dominates all other leaves of wingspan 
intersecting W from the right. Let H be the auxiliary graph on L2, where two L2-leaves are 
adjacent if their wingspans intertsect. H has maximum degree less than 6{k + 1)(4A; -|- 3) 
by the above discussion. By Lemma [231 if T has at least 6{k -|- 1)(A; -|- 2)(4A; -|- 3) vertices, 
then H has a stable set of size k + 1. Thus T would have k + 1 disjoint wingspans, and thus 
A: -|- 1 disjoint requests, a contradiction. 

Claim 4. There are 0{k^) /2-nodes. 

Proof. By Claim [U there are 0(k'^) /2-nodes with leaves. Let us bound the number of inner 
nodes. Let I' be the set of inner nodes in T. Consider the graph G on the set of vertices /' 
where there is an edge xy if xy is a request in T. 

Because of Rule Inclusion, each inner node has degree at most two in G (one in each 
direction). Thus G is a disjoint union of paths, called request paths. The length of a request 
path is at most k by Rule Disjoint Requests. A node with degree 1 in G is an extremal inner 
node. 

Each extremal inner node must be an R-neighbor in T of a leaf or of an internal node 
with a leaf (otherwise it would be reduced by Rule Unique Direction). Denote by X the set 
of leaves and internal nodes with a leaf attached to it. Each node in X has 0{k) R-neighbors 
among the inner nodes, and \X\ = 0{k^), so there are 0{k'^) inner nodes with a neighbor in 
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X (in particular, at most 0(/c^) extremal inner nodes). Each extremal inner node belongs 
to a unique request path of size at most k. Moreover each inner node with no neighbor in 
X must belong to a request path. So there are 0{k^) inner nodes in T. ■ 

There are 0{k^) leaves and 0{k^) internal nodes in a reduced instance. Thus the 
MULTICUT In Caterpillars problem has a kernel of size 0{k^). ■ 

3. General Trees 

Should no confusion arise, we retain the terminology of the previous section. 
Let (r, P, k) be an instance. Let T' be the tree obtained from T by deleting the leaves. 
We partition the set of nodes of T into the following seven sets: 

• The set Ii of leaves in T' . 

• The set I2 of degree 2 nodes in T' . 

• The set I3 of the other nodes in T' . 

• The set Li of leaves rooted at Ii. 

• The set L2 of leaves rooted at I2, endpoint of no group request. 

• The set L'2 of leaves rooted at I2, endpoint of at least one group request. 

• The set L3 of leaves rooted at I^. 

We also denote by / the set of internal nodes of T, and by L the set of leaves of T. 

We need a few technical definitions. A caterpillar of T is a maximal connected compo- 
nent of T — 73 — L3. The backbone of a caterpillar is the set of internal nodes of T in this 
caterpillar. A caterpillar C is non-trivial if the set of internal nodes in C seen as a caterpillar 
has size at least two. The extremities of a non-trivial caterpillar C are the two nodes of C 
which are I2 or /i-nodes of T and become 7i-nodes in C. A minimal request of a node x is a 
request having x as an endpoint and which internal path is minimal for inclusion among all 
internal paths of requests with endpoint. If several requests have the same internal 

paths, we arbitrarily distinguish one as minimal and will not consider the others as minimal. 
If xy is a minimal request of x then y is called a closest R-neighbor of x. 

Let X and y be nodes in T. If z lies on the path between x and y, or is a leaf rooted at 
the path between x and y, we say that z lies toward y from x (and we do not write "from 
x" should no confusion arise). 

Assume x is an L2-leaf of a caterpillar C (that is, an L2-leaf of T which belongs to C). 
Let f{x) be the node from which x is pending. Let Gr{x) be the group of leaves pending 
from f(x). Let A(x) and B{x) be the two connected components of T — {/(x)} — Gr(x). Let 
a{x) (resp. b{x)) be the extremity of C in A{x) (resp. B{x)). II A{x) (resp. B{x)) contains 
no extremity of C, that is if f{x) is an extremity of C, then we define a{x) = f{x) (resp. 
b{x) = f{x)). A wingspan of x is formed by the restriction to internal nodes of the union 
of two requests between x and two of its closest R-neighbors lying respectively in A{x) and 
B{x). Observe that x can have several wingspans. The subcaterpillar of the wingspan W 
consists in W and the leaves rooted at W. 

An L2-leaf x covers a caterpillar C if either x ^ C and there is a request starting at x 
and going through the whole backbone of C, or if x G C and there are two minimal requests 
starting at x which together cover the whole backbone of C. 

We apply the following reduction rules to an instance: Rules (0), (1), (2), (3), and (4) 
are stated in the previous section. Rule Dominating Wingspan is split for convenience into 
two rules, one similar to the caterpillar case and a more general one, as follows: 
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(5a) Bidimensional Dominating Wingspan: if .r is an L2-leaf of a caterpillar C with a 
wingspan W such that W (IC dominates at least k + 1 endpoint-disjoint requests, 
then we contract e{x). 

(5b) Generalized Dominating Wingspan: assume that x is an L2-leaf of the caterpillar C, 
and that x covers C. Assume that for every closest neighbor z of a: in A{x), there 
exist A; + 1 endpoint-disjoint requests between a node lying toward b{x) from x and 
a node toward z from a{x). Then we contract e(x). 

Each iteration of the reduction consists in applying the first applicable rule, in the above 
order. 

Lemma 3.1. Rules (5a) and (5b) are safe. 

Proof. Safeness of Rule Bidimensional Dominating Wingspan follows from the safeness proof 
of Rule Dominating Wingspan in the previous section. 

Assume Rule Generalized Dominating Wingspan can be applied to x. Let zi,...,zi 
be the closest R-neighbors of x in A{x). For every i G {!,...,/}, because of the k + 1 
endpoint-disjoint requests mentionned in the rule, any A;-muIticut contains an edge in the 
path between Zi and b{x). Assume that a fc-multicut S contains an edge e" between x and 
b{x). Let e' be the edge adjacent to e(x) in the path between x and a(x). If S contains 
e(.T), then S — {e{x)} U {e'} is also a fc-multicut. Indeed, any request x,u with u G A{x) is 
cut by e', and any request x,ij with v £ B{x) is cut by e" . Assume now that a /c-multicut S 
contains no edge between x and 6(x), then for every i G {1, ...,/}, 5 must contain an edge 
Cj in the path between Zi and f{x). Let e' be the edge adjacent to e{x) in the path between 
X and b{x). If S contains e{x), then S — {e{x)} U {e'} is a fc-multicut. Indeed, any request 
x,u with u G A{x) is cut by an edge Cj, and any request x,v with v G B{x) is cut by e' . m 

Proposition 3.2. The repeated application of these rules on the instance until none can be 

applied takes polynomial time. 

Proof. The proof of the first five cases was made for general trees in the previous section. 
The polynomiality of Rule Bidimensional Dominating Wingspan follows from the proof of 
Rule Dominating Wingspan's polynomiality in the previous section. Deciding whether there 
exist k + 1 endpoint-disjoint requests between prescribed areas can still be expressed as 
a matching problem as in Rule Dominating Wingspan's proof, so the application of Rule 
Generalized Dominating Wingspan also takes polynomial time. ■ 

Theorem 3.3. The number of nodes in a reduced instance is 0{k^). 

The rest of this section is devoted to the proof of this theorem. 

Claim 5. = 0{k) 

Proof. There are at most k groups of leaves with a group request, by the A; -|- 1 disjoint 
requests rule. Every group of Li-leaves has a group request, otherwise any leaf of this group 
would be deleted by Rule Unique Direction. Every /i-node has at least one Li-leaf pending 
from it, thus < A;. ■ 

Claim 6. I/3I = 0{k) 

Proof. In a tree, there are at most as many nodes of degree at least 3 as the number of 
leaves, so I/3I < < A:. ■ 



A POLYNOMIAL KERNEL FOR MULTICUT IN TREES 



191 



Claim 7. |Li[ = 0{k'^) and = 0(A;2) 

Proof. Each leaf in Li is a bad leaf by Rule Unique Direction, and each leaf in L2 is bad 
by definition. As in Claim [1] there are at most 2{k + l){2k + 1) — 1 bad leaves in T. Thus 
|Li UL'al = 0(A:2) ■ 

We now show that: 
. IL3I =0(fc4) 

• IL2I =0(fe4) 

• I/2I = 0(A:6) 

Claim 8. The number of requests from a node 2; to a group of leaves is at most k + 1. 

Proof. Otherwise Rule Common Factor would apply to these requests. ■ 

Claim 9. The number of requests from a node x to all the -L2-leaves in a given caterpillar 
C is at most 2k + 2 if x e C and k + 1 if x ^ C. 

Proof. Otherwise there would be at least k + 2 requests sharing the same direction between 
X and leaves in this caterpillar, and Rule Common Factor would apply to these requests. ■ 

Claim 10. There are at most {2k + + 2) — 1 requests between two groups of leaves. 

Proof. Let G be the bipartite graph which vertices are the leaves of the two groups Y and 
Z, and where a leaf in Y and a leaf in Z are adjacent if there is a request between them. 
The maximum degree in G is at most A; + 1 by Claim [8l thus if there are {2k + 1)(A; + 2) 
requests between Y and Z, then by Lemma [2]4] there would be a matching of size A: + 2 in G. 
Thus there would he k + 2 endpoint disjoint requests between Y and Z, and Rule Common 
Factor would apply. ■ 

Claim 11. The number of requests between a group of leaves E and the nodes in a given 
caterpillar G is at most 2(2A; + l){k + 2) - 2. 

Proof. Assume by contradiction that there are at least 2(2A; + l){k + 2) — 1 such requests. 
Let / be the node in which the leaves of E are rooted. If / belongs to C, then G — f has 
two connected components. Among these two components, we select the component G' in 
which there is the largest number of requests from E. If / does not belong to C, then we 
let C" = G. There are at least {2k + l)(/c + 2) requests between G' and E. Consider the 
undirected (bipartite) graph G which vertices are the leaves of E and the nodes of C", and 
where there is an edge between a leaf from E and node from G if there is a request between 
them. This graph has maximum degree + 1 by Rule Common Factor, thus by Lemma [2^ 
G has a matching of size k + 2. Thus there would be A; + 2 endpoint disjoint requests, and 
Rule Common Factor would apply to them. ■ 

Claim 12. There are at most 2k — 1 caterpillars in T. 

Proof. There are at most 2k nodes in Ii U I3. Let us call them separating nodes. Let r 
be one of these separating nodes. Let us consider r as the root of T. Each caterpillar 
is adjacent to exactly two separating nodes. Let us associate to each caterpillar of T its 
adjacent separating node further away from the root r. This mapping is a bijection, and no 
caterpillar is mapped on r, thus there are at most 2A: — 1 caterpillars. ■ 
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Claim 13. 

\L3\ = 0{k^) 

Proof. We have that II3I = 0{k) by Claim [6l Let X be an Ls-group rooted in y G I3. 
Because of Rule Disjoint Requests, at most {2k + l)(/c + 1) — 1 leaves in X are endpoints 
of group requests (by Lemma 12.41 on the usual auxilliary request graph on X). Each leaf 
of X must be the endpoint of at least one request, so let us count the maximal number 
of requests contributed by each type of nodes. By Claim [TOl and as there are at most k 
groups of Li-leaves and k groups of La-leaves, at most k{{2k + l){k + 2) — 1) leaves of X 
have a request toward an Li-leaf or an La-leaf. There are at most 2k — 1 caterpillars in T 
by Claim [T2j and leaves in X have in total at most 2(2A; -|- 1)(A; -|- 2) — 2 R-neighbors in 
any caterpillar by Claim [TTl Thus 0{k^) leaves in X are endpoints of a request toward a 
caterpillar node, and I3 nodes can contribute for at most 0{k'^) requests, so |X| = 0{k^). 
This gives \L3\ = 0{k'^). m 

Claim 14. 

IL2I = 0(A;4) 

Proof. Assume by contradiction that IL2I > 3{2k - l){k + l){k + l)(4fc + 3). Let C be a 
caterpillar of T containing the maximum number of L2-leaves. By Claim [T2l there are at 
most 2k — 1 caterpillars in T, thus C contains at least 3(A: -|- 1)(A; -|- 1)(4A: -|- 3) L2-leaves. 

Assume first that C is not covered. We obtain a contradiction as in the caterpillar 
case. Consider x to be the L2-leaf having a wingspan which intersection W with C has 
maximal size. Let C be the subcaterpillar of backbone W. Then C contains at least 
{k + l)(4/c -|- 3) L2-leaves, otherwise one would find k + 1 disjoint wingspans by taking W, 
then a Wi disjoint from W , then a W2 disjoint from W and Wi, . . . , and finally a 
disjoint from W, Wi, . . . , Wk-i, as in Claim [3l Note that the caterpillars W, Wi, . . . , Wk 
are disjoint, as their intersections W, Wi, . . . , Wk with C are disjoint and non-empty. Thus 
there would he k + 1 disjoint requests, a contradiction. Since W is maximal, each L2-leaf y 
in C is the endpoint of a request r C C". The existence of {k + l)(4/c -|- 3) such leaves means 
there are at least k + 1 endpoint-disjoint requests dominated by W, by Lemma [2^ applied to 
the usual auxiliary request graph G on the L2-leaves of C (note that the maximum degree 
of G is at most 2k + 2). Which means Rule (5a) should apply, a contradiction. 

Assume now that C is covered by some L2-leaf x. If more than {k + l){4:k + 3) L2-leaves 
in C do not dominate C, then some wingspan of x dominates {k + l)(4/c -|- 3) requests, 
and thus dominates at least k + 1 endpoint-disjoint requests, by the usual application of 
Lemma 12. 4[ So Rule Bidimensional Dominating Wingspan should apply, a contradiction. 
So at least 3{k + l){k + l){Ak + 3) - {k + l){4k + 3) L2-leaves in G cover C, let X be the 
set of these leaves. Let di, . . . ,dj be the Ii-nodes in A{x). Note that j < k. 

For such an Ii-node di and a leaf x & X having at least one quasi- R-neighb or lying 
toward di, let us denote by rn{x,i) the closest quasi-R-neighbor of x toward di. Let RN{i) 
be the set of all nodes rn{x, i) for leaves x ^ X having at least one quasi-R-neighbor lying 
toward di. Note that the nodes of RN{i) lie on the segment [a{x),di]. Denote hy x\,. . . ,x\ 
the leaves in X having at least one quasi-R-neighbor lying toward di, ordered according to 
the distance between a(x) and rn{x,i), from closest to furthest. If t > (A; -|- 1)(4A; -|- 3), 
denote by Xi the set {x\, . . . ,x\^k+i){4k+3)}- 

When less than {k + l)(4/c -|- 3) L2-leaves in X have a quasi-R-neighbor toward di, mark 
di as invalid, and proceed. Note that at least one di must be valid, as |X| > k{k + l){4:k + 3). 
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Now we have a list of at most k sets (the sets Xi for di vahd) of size + l)(4A; + 3). The 
union X' of these is of size at most k{k + l)(4/i; + 3) < |^|. Thus there exists an -L2-leaf z in 
X — X' . Consider the closest quasi- R-neighbor of z toward a valid di. There are either 
(A;-|-l)(4A;-|-3) L2-leaves of Xj between z and a{x) or (A;-|-l)(4A;-|-3) L2-leaves of Xi between z 
and h{x). Thus there are k + 1 endpoint-disjoint requests either between the subcaterpillars 
spanned by the segments ]z,a(x)[ and ]a{x),ni[ or between the subcaterpillars spanned by 
the segments ]6(x),2;[ and ]a(x),ni[, by Lemma [2^ on the usual auxiliary request graph. In 
the former case Rule Common Factor applies, in the latter Rule Generaliz;ed Dominating 
Wingspan applies. ■ 

Claim 15. 

I/2I = 0(A;6) 

Proof. There are 0(/c^) internal nodes with leaves in T, by Claim [TH It remains to bound 
the cardinal of the set Z of inner nodes in I2. 

Let r be an Ii-node of T, we now consider r as the root of T . Let u be a node of Z . 
Let C{u) be the caterpillar containing u, denote by a{u) and h{u) its extremities, with h{u) 
an ancestor of a{u) with respect to r. Let A{u) be the connected component of T — {u} 
containing a{u). If the node u has an R-neighbor in A{u), select such node v{u). Note that u 
is on the path bewteen v{u) and r. Thus, by Rule Inclusion, v{u) 7^ v{u') whenever u 7^ u' . 
Let G be the graph with vertex set Z, and with edge set {{u,v{u))\u G Z}. This graph G 
is a disjoint union of paths. By Rule Disjoint Requests, paths in G have length at most k. 
Vertices u in G which have no R-neighbor in A{u) must be adjacent in T to some node not 
in Z, by Rule Unique Direction. There are 0{k^) nodes not in Z, each of which can have 
at most k R-neighbors in Z . Indeed, a vertex cannot have two different R-neighbors in the 
same direction, by Rule Inclusion. Thus there are 0{k^) vertices u without R-neighbor in 
A{u) in G, which gives that there are 0{k^) vertices in G, which finally means that there 
are 0{k^) inner nodes in T. □ 

This concludes the proof of the theorem. ■ 



4. Conclusion 

We have shown that the (unweighted) MULTICUT IN TREES problem admits a polyno- 
mial kernel. This kernelization algorithm, or just some particular sequence using some of 
the reduction rules presented above, can be used as a preprocessing or in-processsing step 
in a practical algorithm. 

This analysis might not be tight, so one can hope to improve this 0{k^) bound retaining 
the same set of reduction rules. New reduction rules might be needed to decrease this bound 
even further. 

Our technique does not seem to generalize to the weighted version of MULTICUT IN 
Trees. Thus deciding whether the Weighted MULTICUT IN TREES problem admits a poly- 
nomial kernel is still open. 

It is not known whether the general Multicut in Graphs problem is FPT with respect 
to this parameter fc, even for graphs of bounded treewidth. If it turned out to be true, then 
the question of the existence of a polynomial kernel for Multicut in Graphs would rise. 

Among the most notorious open problems on polynomial kernelization stand Directed 
Feedback Vertex Set and Clique Cover. Directed Feedback Vertex Set consists in deciding 
whether a graph admits k vertices which removal makes the graph acyclic. This problem 
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was shown to be FPT in [2]. Clique Cover consists in deciding whether the edges of a graph 
can be covered by at most k chques. 
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